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Abstract 

We conjecture a sharp bound on the rate of growth of chaos in thermal quantum 
systems with a large number of degrees of freedom. Chaos can be diagnosed using an 
out-of-time-order correlation function closely related to the commutator of operators 
separated in time. We conjecture that the influence of chaos on this correlator can 
develop no faster than exponentially, with Lyapunov exponent < 27 TkBT/h. We 
give a precise mathematical argument, based on plausible physical assumptions, 
establishing this conjecture. 
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1 Introduction 

Strong chaos, the butterfly effect, is a ubiquitous phenomenon in physical systems, ex¬ 
plaining thermal behavior, among other things. In quantum mechanics, this phenomenon 
can be characterized using the commutator [W{t), E(0)] between rather general Hermitian 
operators at time separation t. The commutator diagnoses the effect of perturbations by 
V on later measurements of W and vice versa. One indication of the strength of such 
effects is 

c{t) = -{[w(t),v{a)f) ( 1 ) 

where (■) = Z“^tr[e“^^-] denotes the thermal expectation value at temperature T = 
A quantum deflnition of the butterfly effect is that C{t) should become of order 
2{VV){WW) for large t, regardless of the speciflc choice of V, IE within an appropriate 
class. In general, we assume E(0) and IE(0) are simple Hermitian operators, describable 
as a sum of terms, each a product of only 0{1) degrees of freedom]^ We further assume 
that V, W have zero thermal one point functions. 

^Traces of finite products of matrix fields are simple by this definition; time evolved operators 
f large are generally not. 
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We call the time scale where C{t) becomes signihcant the “scrambling time” [^[^. 
There is another shorter time scale relevant for chaos, the exponential decay time for 
two point expectation values like {l/(0)K(t)). We call this time scale the “dissipation 
time,” or, when a quasiparticle description applies, the “collision time.” In the strongly 
coupled systems we will focus on, we expect td ~ (3- We also expect general time or¬ 
dered correlators to approach their long time limits after this time scale. For example 
{V{0)V{0)W{t)W{t)) ~ {VV){WW) + 

We can gain some intuition for the relation between C{t) and chaos by studying the 
semiclassical limit of a one particle quantum chaotic system, like semiclassical billiards, 
following the classic reference [^. Schematically, in the semiclassical limit taking V = p 
and W(t) = q{t) the commutator [g(t), p] becomes the Poisson bracket p} = 

This gives the dependence of the hnal position on small changes in the initial position, 
the classical diagnostic of the butterfly effect. Nearby trajectories in such systems diverge 
exponentially, ~ where is a Lyapunov exponent. Here ~ For early times, 
the correlator C{t) ~ so t* ~ ^log|. In this context is called the “Ehrenfest 

time.” There is a parametrically large hierarchy between scrambling and collision times 
determined, in this case, by the small parameter e = h. Systems with such a large hierarchy 
will be the focus of this paper. 

From a purely quantum mechanical point of view we can follow the analysis of and 
use C(t) as a measure of the growth of the operator W(t) expressed as a sum of products 
of simple basis operators. In qubit models these would just be Pauli matrices. A large 
commutator indicates a complicated operator W(t) that arises because chaos disrupts the 
cancellation between the initial and hnal factors in W{t) = If the number of 

qubits Nq is large it will in general take a long time for a large commutator to build up. 
If the interactions are local, the time is linear in the separation between W and V [6]-[^. 
Even if the interactions are nonlocal, but are formed from products of just a few qubits, 
it will take a time ~ log for C{t) to become largej^ The analog of td here is roughly 
the time for W(t) to add a few Pauli matrices, so large Nq qubit systems provide another 
example of a large hierarchy between and td- Clearly these ideas generalize to a wide 
variety of lattice quantum systems. Here 1/e would be the size of the system (in the 
nonlocal case), or an exponential of the distance between the H(0) and IE(0) operators 
(in the local case). 
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^Scrambling in nonlocal quantum circuits was studied in 
conjectured for nonlocal Hamiltonian systems in and supported by a Lieb-Robinson bound in 


A logarithmic scrambling time was 


15 
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In a lattice system, the square of the commutator in C{t) is a reasonable operator, but 
in a quantum held theory it generally requires regularization. A convenient prescription is 
to move one of the commutators halfway around the thermal circle, so that we consider 


-tr [y‘^[W{t),V]y‘^[W{t),V]] 


( 2 ) 


where y is dehned by 



( 3 ) 


and V is always ld(0). A closely related function, and the one that we will work with 
directly in this paper, is 

F{t)=tT[yVyW{t)yVyW{t)l (4) 


corresponding to insertion of the V and W operators at equal spacing around the thermal 
circle. As explained in Fig. F is analytic in a strip of width /3/2 in the complex 
time plane, and at the edges of this strip we can relate F to the regularized commutator 
discussed above. To see this, notice that F{t — i(3/A) = ii[y‘^VW{t)y‘^VW{t)]^ so 


- tr [y‘^[W{t), V]y‘^[W{t), F]] = ii[y'^W{t)Vy‘^VW{t)] + ii[y‘^VW{t)y‘^W{t)V] 

-F(« + 'i^)-F(«-i^). (5) 

We can use this equation to develop some intuition for the time dependence of F{t). 
First, for small f, all terms on the RHS are positive and roughly equal. The commutator 
is small because of a cancellation between the hrst and second lines. The terms on the 
hrst line can be be written as norms of states, e.g. yW{t)Vy~^\TFD) (the state \TFD) 
is dehned below), so they remain of order one at large t. The growth of the commutator 
is therefore due to a decrease in F(t ± i/d/4). This gives us a second quantum dehnition 
of the butterhy effect: at large t, F should become small, regardless of V, W. 

We will give two additional pieces of intuition for the late-time decrease of F. The hrst 
is based on the observation that, as t becomes large, all pairs of operators are separated 
by large intervals along the contours in Fig. [T} This is true independently of r. Notice 
the contrast here between correlation functions with the contour ordering VW{t)VW{t) 
(which decay at large t) and correlation functions with the ordering VVW{t)W{t) (which 
do not). 

The second piece of intuition requires us to introduce the thermoheld double state in the 
Hilbert space of two copies of the quantum system, \TFD) = lW)R- 

For any operator O, we dehne Ol = ® 1, acting only on the L Hilbert space, and 
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Figure 1: F{t) is the correlation function of operators arranged on the thermal circle as 
shown at left. The folds indicate the Lorentzian time evolution to produce W{t). At 
complex time, F{t + ir) is given by rotating one of the pairs of operators by angle 2t\t jf3. 
The center panel corresponds to |r| < /?/4 and the right panel corresponds to r = /3/4. 


Or = acting only on R. As an entangled state, \TFD) has a very nongeneric pattern 
of correlation between L and R. In particular, simple operators are highly correlated, so 
that e.g. (TFD\VlVr\TFD) is large. 

The point of this preparation is that we can understand F{t) as a similar two-sided 
correlation function in a perturbed version of \TFD). Specihcally, F{t) = (T|VlVr|T) 
where 

|T) = (6) 

mn 

For small f, the simple W operator will not signihcantly change the global pattern of 
correlation in the state, so F remains large. However, as t increases, the W{t) pertur¬ 
bation becomes more and more complicated, and the delicate local correlations present 
in the thermoheld double state will be destroyed [^, causing F to become small. This 
perspective makes the connection to the classical butterfly effect particularly clear. 

Note that from the two-sided perspective, the ordering of operators in F is quite nat¬ 
ural. F{t ± i/3/4) has a simple interpretation as a correlator in the thermoheld double 
state, with two operators acting on one side and two operators acting on the other. This 
correlation function is actually time ordered with respect to a two-sided time that increases 
forwards on both sides. However, in the rest of the paper we will reserve the term “time 
ordered” for conhgurations where the order of the operators in the trace coincides with the 
order expected when we view t as a time variable. The distinction is important because t 
runs forwards on the R system and backwards on L. 

Another important class of examples with a large hierarchy between scrambling and 
dissipation scales are the large N gauge theories and related systems that can be studied 
using gauge/gravity duality. Here the number of degrees of freedom is N‘^ = 1/^- For such 


4 


systems, there has been recent progress in computing correlators such as C(t) and F(t) 
using holographic techniques in black hole backgrounds 16, IT,The key element 
of these calculations is the connection of the long time behavior of ([^ and Q to a high 
energy scattering process near the bulk black hole horizon. The center of mass energy 
squared s ~ ^ exp grows exponentially with time, as dictated by the local Rindler 
structure of the horizon 22,23 . The strength of this scattering becomes of order one 
when Gtvs ~ 1 in AdS units. 

For a large N CFT holographically described by Einstein gravity, the methods of 
give (for 13) 


f 27r 

F{t) = fo-f,expjt + 0{N-^) 


(7) 


where /o, /i are positive order one constants that depend on the specihc operators V, W. 
The growing N~‘^ term gives the hrst indication of the butterfly effect, that is, the beginning 
of a rapid decrease of F{t) that takes place near the scrambling time t* = ^ \ogN‘^. The 
dissipation time in such systems is determined by black hole quasinormal modes which give 
td ^ /3 for low dimension operators. So again there is a large hierarchy between scrambling 
and dissipation. 

This result provides the reference point for the following conjecture. 


2 Conjecture 

We conjecture that chaos can develop no faster than the Einstein gravity result ([^ in 
thermal quantum systems with many degrees of freedorrj^ and a large hierarchy between 


scrambling and dissipation.This conjecture is similar in spirit to the rj/S result of KSS 24 


that points to black holes in Einstein gravity as systems with very strong scattering. It is 
a rehnement of the fast scrambling conjecture of which again singles out black holes. 

In such systems, out of time order correlators such as F in (|^ should display the 
following behavior. Well after the dissipation time td, but well before the scrambling time 
they take an approximately constant factorized value F{t) ~ Fd, where 


Fd = tT[y^Vy^V]tT[y^W{t)y^W{t)] 


( 8 ) 


^See also 


20 


21 


for computations using related large c sparse spectrum techniques in d = 2 CFTs. 


'*’In semiclassical billiards this would be the number of cells in phase space. 
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is the product of disconnected correlators. Due to time translation invariance, this is 
independent of t. For example, in a large N system with t independent of N, large N 
factorization implies 

F{t) = {tT[VyW{t)y^]Y + {tT[W{t)yVy^]y + tT[y^Vy^V]tT[y^Wy^W] + 0{N-^) (9) 

The hrst two terms decay to zero for t S> td- In more general systems the role of large N 
is played by the large number of degrees of freedom that cause commutators to be small 
for t -C f*. 

However, due to quantum mechanics and chaos, F{t) cannot remain a constant forever. 
Scrambling causes a commutator to develop and F{t) to decrease. We conjecture that this 
rate of decrease is bounded (for times greater than a time to, which will be discussed at 
length in § 1^: 

j^(F,-F{t))<j(,F,-F(,t)). (10) 

As Kitaev has emphasized, building on |^, if the system is chaotic we expect correlators 
like Fd — F{t) to initially grow exponentially 

Fd-F{t) = eexpXit-\ - (11) 

where Xl might depend on the operators V, W as well as the particular quantum system. 
We will follow Kitaev and refer to X^ as a Lyapunov exponent. (This exponential behavior 
and the factor of e in ( pT| are related to the fast scrambling conjecture of [^.) 

Assuming this form we conjecture the existence of a universal bound 

Xl<j = 27iT. (12) 

In the following section we present evidence motivating this bound. In §|^we give a precise 
argument, based on plausible physical assumptions, establishing it. 

3 Motivation for the conjecture 

A number of lines of evidence led us to this conjecture. These involve the study of large 
N gauge theories, with and without gravity duals. 

Einstein gravity 

In the holographic calculations that use Einstein gravity in the bulk, the 

result Q holds independent of d and independent of the choice of V and W. This is 
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because (i) gravitational scattering is of order Gns (in AdS units) because the graviton is 
spin two, and Gn oc N~^, (ii) gravity couples universally, and (in) s ~ exp because of 
the kinematics of Rindler horizons. 


Higher derivative corrections 

The result ([^ is unchanged if Einstein gravity is modihed by higher derivative corrections 
with a hnite number of derivatives, like the Gauss-Bonnet term This is because such 
corrections do not change the spin of the graviton, so (i) remains true. The relation (Hi) 
also remains correct as long as the thermal state is dual to a black hole with a smooth 
horizon. Notice that the situation here is different than for the rj/S calculation, where 
higher derivative couplings can move rj/S above and below the reference Einstein value of 
l/dvr |2§|^ . This suggests that a sharp bound might exist for 


Weak coupling 

If the gauge theory is weakly coupled, with ‘t Hooft coupling A independent of N, the 
intuition described in suggests that because the strength of gluon scattering in the 
gauge theory is of order A at small A, the Lyapunov exponent should be small, A^ ~ A//3, 
parametrically smaller than in the gravitational limitWe expect this to be true in any 
weakly coupled theory. 


Stringy corrections 


In a bulk weakly coupled string theory in a geometry with large radius of curvature, the 
hrst corrections to the Einstein gravity calculation of scrambling can be computed using 


the perturbative string theory techniques of 29 . For planar or spherical horizons. Ref. 19 
showed that 


A 




(13) 


where Ig is the string length, and /i^ is a constant that appears in the equation for a shock 
wave propagating along the horizon. This equation involves the transverse dimensions 


^ For the particular case of Rindler AdS black holes (hyperbolic black holes at temperature /3 = 27r) it 
was indicated in 19 that Al is the same as the Regge intercept j{t = 0) — l in the gauge theory, which can 


be computed using the BFKL analysis and is of order A at small A. See the discussion in 28 . This case 


is discussed in more detail in Appendix A. It was also suggested in that a modification of the BFKL 
weak coupling calculation would allow the calculation of A^ at weak coupling in more general cases. 
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and is of the form — jj^)h = 0. Here h is the shock wave prohlej^ (If the horizon is 


hyperbolic, we replace by in (13), where is the lowest eigenvalne of — V^^.) 

Einstein’s eqnations imply that is positive if the transverse space warp factor grows as 
one moves away from the bifnrcation snrface in a spacelike direction 30,31 This will be 
the case if this horizon corresponds to a wormhole-like conhgnration. This is the geometry 
appropriate to the dnal of a thermal held theory. 


Scattering bound 


Becanse of the Rindler relation between bnlk scattering energy and time s ^ exp the 


bonnd (12) is eqnivalent to the bnlk statement that the eikonal phase d is of order Gjvs^ 


and p < 1. (A spin J held exchanged in the Mandelstam t-channel gives p = J — 1). The 
anthors of 32 argued that in scattering p must be < 1 because causality requires to be 
analytic in the upper half of the complex s plane and unitarity requires < 1 there. 

This is consistent with our conjectured bound and suggests that unitarity, analyticity 
and causality are the crucial assumptions necessary to prove the bound. We work in 
Hamiltonian systems where unitarity and causality are manifest, and correlation functions 
are analytic. Because of the relation between s and time t, a natural strategy is to formulate 
a bound on F in the complex t plane. 


4 Argument 


In this section we will provide a two-part argument for the bound. The hrst part consists of 
a simple mathematical result bounding the derivative of any function that satishes certain 
assumptions. The second part consists of physical arguments that F should satisfy closely 
related assumptions in the systems of interest. The resulting bound on the derivative 
implies (10). 


4.1 A mathematical result 

Suppose we have a function f(t) with the following properties: 

1. f{t + iT) is analytic in the half strip 0 < f and — | < r < (Here t and r are the 
real and imaginary parts of the complex number t + ir.) We also assume that f{t) 

^We are grateful to Mark Mezei for discussions on this point. 


2 _ d{d—l) 


“For the particular case of a planar AdSd+i black brane we have = W- 
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is real for r = 0. 


2. \f{t + ir)| < 1 in the entire half strip. 


Then we claim that 


1 

w 


d[_ 

dt 




( 14 ) 


Before presenting the proof, it is usefnl to consider the example /(f) = 1 — Here 

it is easy to see that the above properties imply the bonnd Al < ^. 

To establish the claim in general, we hrst map the half strip to the nnit circle in the 
complex plane using the transformation 


1 — sinh 

+ 

. 

1 _ 1 

1 + sinh 

+ 

. 

1 _ 1 


(15) 


Then f{z) is an analytic function from the unit disk into the unit disk, thanks to the 
second property. Such functions cannot increase distances in the hyperbolic metric (the 
Schwarz-Pick theorem). The hyperbolic metric is = 4dzdz/{l — |^p)^, so we must have 


M/I 


< 


M^l 


i-|/(^)P i-MP' 

We apply this inequality for r = 0 where / is real, hnding 


1 -/ 


dt 


271 , / 27rf 

< J COth ( — 


(i±/) < 

2-/3 ^ ’ 


(16) 


(17) 


which is the claim (14). 


4.2 Deriving the bound 


If we could show that F{t)/Fd satishes properties one and two, above, then (14) would 

(18) 


imply the conjecture (10). Recall that Fd is the disconnected correlator 

Fd = ii[y^Vy^V]ii[y^Wy^W]. 


The hrst property is easy to establish. The meaning of F for complex times is most 
simply understood from Fig. but we can also write it out explicitly as 


F{t + ir) = (19) 
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For finite N and finite volnme the RHS defines an analytic function in the strip |r| < /d/4, 
even in quantum held theory. We also see that when t = 0 F{t) is real. (Recall that W 
and V are Hermitian operators.) Therefore the hrst property holds in general. 

The second property is more subtle. In fact, we will only show that \F(t + iT)\ < F^ + e, 
for an appropriate e, and for times t greater than a reference time to- This will allow us 
to apply the result from the previous section to the function 


fit) = 


F{t + to) 
Fd + £ 


( 20 ) 


Provided that £ is small, this will give us the bound (10) up to small errors, for times 


greater than to- We will derive conditions on e: and to in the process of arguing that / 
satishes property two. 

Our strategy will be to show that |/(t+ir) | < 1 on the three boundaries of the half strip 
0 < t and —/d/4 < r < /3/4, and that / is bounded by some constant everywhere in the 
interior. Then the Phragmen-Lindelof principle (the analog of the maximum principle for 
non-compact regions) implies that the function actually obeys \f{t -|- ir)| < 1 everywhere 
in the interior, establishing the second property. 

First, we consider the edges of the half strip |r| = /d/4. Notice that 


F{t - i(3/A) = ti[y‘^VW{t)y‘^VW{t)]. 


( 21 ) 


The RHS can be viewed as an inner product of “vectors” [yVW{t)y]ij and [yW{t)Vy]ij 
{W and V are assumed Hermitian). The Cauchy-Schwarz inequality then give^ 

\F{t - i/d/4)| < tT[y‘^W{t)Vy‘^VW{t)]. (22) 


In a chaotic system with many degrees of freedom, and for times large compared to the 
dissipation timescale, we expect that the RHS factorizes and is given by Fa- This is the 
main physical input to the argument. To make the possible error explicit, we dehne £ by 
the condition that for all t > to, we will have 

tT[y^W{t)Vy^VW{t)] < tT[y'^W{t)y'^W{t)]tT[y^Vy^V] + e . (23) 


In general the size of £ will depend on to- In systems where we can take £ small while 
keeping to ^ t*, we will get a good approximation to the bound (10) once F^ — F{t) 
exceeds £. We will analyze £ and to in some example systems in the following sections. 


®Note that at leading N ^ order, the Einstein gravity result (7) saturates this bound. 
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For the present purposes, the important point is that with the dehnition of e in (23), the 
Cauchy-Schwarz inequality ensures that |/| < 1 on the edges |r| = (3/A. 

Next, consider the third boundary at t = 0. This corresponds to F(to + it) with 
~/9/4 < T < /9/4. Here the possible error in factorization has two sources. One is the 
failure of the time-ordered correlation function to factorize, which is order e. The other is 
due to the fact that F is not time-ordered; F will begin to move from its factorized value 
due to the onset of scrambling. In general, we expect this to cause F to decrease, but it 
is not necessary to assume this. As long as we choose to early enough that the effect of 


scrambling is smaller than the e dehned by condition (23), the second error will be smaller 


than the hrst, so |/| < 1 on the third boundary as well. 

To complete the argument for property two via the Phragmen-Lindelof principle, we 
need to establish that / is bounded in the interior by some constant, \f{z)\ < C, where C 
might be bigger than one. Again, we apply the Cauchy-Schwarz inequality, viewing F as 
the product of two vectors. Choosing the vectors appropriately we hnd (for positive r) 

\F{t + zr)| <tT[y^+Wy^-^Wy^+^Wy^-W] 

r~^tT[y^^Wy^-'^V]tr[y^^'^Wy^-^W] (24) 

<tT[yVy^V]tT[yWy^W] 

with 7] = ^. In the second line we have again invoked factorization at late times for time 
ordered correlators. (All the W are evaluated at time t and V at time zero.) The third 
line uses Hermiticity of V, W and the contracting property of y. What appears on the 


RHS is not the same as Fd, since we have fewer powers of y compared to (23), but it is 
hnite, so we have established property two. 

Finally, let us address a slight imprecision in our discussion. In the above we assumed 
that the largest times we would talk about are of order the scrambling time, which are 
logarithmic in the small parameter. On the other hand, after very large times we can have 
Poincare recurrences, and we expect factorization to fail. To avoid this we can cut off the 
half strip by adding an additional boundary at a time much larger than the scrambling 
time but much smaller than the recurrence time. At this additional boundary we need to 
have \F\ < Fd + e. In a chaotic system, we expect F to be very small for almost all times, 
so it should be easy to hnd a suitable time for the cutoff]^ The conformal transformation 
from this hnite strip to the disk will be more complicated, but it will coincide with the 
one we used in the region of interest for our arguments. 


^Assuming incommensurate energies, one can show that the long time average of F is exponentially 
small in the entropy of the system. 
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We conclude that / in (20) satisfies properties one and two from the previous section. 


The mathematical result (14) then implies that for t greater than plus a few thermal 
times, we have 

j^(Fi-F(t))<j(Fa-F(t)+E). (26) 


Here, to recap, £ is the maximum error in the time ordered factorization (23) for times 


t > to. For different systems, we might make different choices of £ and to, in order to get 
the best bound. Examples will be discussed below. The essential point is that for a wide 
class of chaotic systems where V, W are small perturbations, we expect the scrambling 
time to be large, and we expect factorization to hold up to small errors after a time to 


with to ^ f*- For such systems, the result (25) implies the bound (10) for the growth of 


Fd — F{t) once this quantity exceeds the small error. 


4.3 Examples 

4.3.1 Large N systems 

In large N systems we can take V and W to be single trace operators and exploit large N 


factorization. The error e: in the estimate discussed in (23) is then given by 


£ = (tTlVyW{t)y‘]f + (ti[W(t)yVy^\f + 0(N-'‘) . 


(26) 


For general V and W these off diagonal expectation values are nonzero but decay because 
of dissipation, leading to an estimate s ~ N~‘^ + We must now choose to- In order 

to get the best bound, we set £ equal to the growing effect of scrambling on F(t) at time 
to- As an example, suppose that Fd — F{t) is proportional to Then the optimal to 

is given by t*/(1 + y^), and we have £ ~ e^Ltd/{^+>^Ltd)_ Once F^ — F{t) exceeds this value 


near the time to), (25) implies the bound (10). 


We can get a bound for a wider range of times if the system has a global symmetry 
like parity and we choose V and W to transform differently under it. Then the first two 
terms above vanish and e ~ N~^. This means we can take to = 0 and still make chaotic 
effects dominate over the error. Because chaos is almost by definition generic even special 
operators will couple to the basic chaotic dynamics of the system so we can apply the 
bound to the very early development of this chaos. In particular, by integrating ( [TT] ) from 
early time we find 

^ f 27^ \ 

( 27 ) 


F, 




where c is an order one N independent constant. 
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4.3.2 Extended local systems 


For lattice systems, or for thermal quantum field theories, the large number of degrees of 
freedom comes from the fact that we have an extended system. We can take V to be an 
operator at the origin and W to be an operator at a site at large distance L. For such 
systems, we get an interesting bound by setting to = 0. Then e is equal to the maximum 
over t of 

ti[y^W{t)Vy^VW{t)]-ii[y‘^W{t)y‘^W{t)]ii[y‘^Vy‘^V]. (28) 


o 


in general, because of the short range 


At t = 0, we expect the above to be 
correlations in the thermal state. 

In special systems (such as those discussed below), this factorization might break down 
for times t oc L, due to the possibility of signalling between W and V. However, for generic 
chaotic systems at finite temperature, we expect that signals should be exponentially 
suppressed in distance, so that the difference in (28) is < for all time. We can then 
take e = As before, the bound (25) implies (10) once Fa — F{t) exceeds this small 

value. Note that this may take a long time if L is large. 


4.3.3 Cases where there is no bound 

There are local systems for which factorization {VW{t)W{t)V) ~ iyV){WW) does not 
hold for an appropriate range of times, even for widely separated V, W. For example, 
consider a massless free field 0 in two dimensions and take the operators to be H(0) = 
d_0(O) and W{t) = d-(j){L — t). Even if L is large, the contraction between V and W 
becomes important for t ^ L. This is the same time at which the commutator becomes 
nonzero, so we cannot bound its growth. 

Indeed, in this system the commutator is [V, Wit)] oc 5'{L — t), which rises very fast, 
independently of the temperature]^ 

There is a related issue in any two dimensional conformal held theory. Such theories 

contain a stress tensor operator T_(x“) which has singularities along the light-cone. 

Taking V = T _and W some other local operator we hnd that factorization fails near 

the light cone. However, at large c, this is suppressed by 1/c and (after smearing) can be 
absorbed within the small e that we are tolerating. 

In fact this is a problem specihc to two dimensional systems where the light cones are 
one dimensional so signals cannot spread around them. In higher dimensions this is not 

can smear the operators a bit, but we retain the same conclusion: the growth is determined by 
the parameters of the smearing function rather than the temperature. 
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an issue and the commutators are suppressed at large spatial separation. This is easy to 
see in free theories, and for general conformal held theories, as explained in Appendix A. 

In addition to the factorization assumption, we have assumed that there is a large 
hierarchy between the dissipation time and the scrambling time. We have justihed this on 
the grounds that we have many degrees of freedom and that the Hamiltonian is built from 
hnite products of simple operators. Alternatively, one could consider a Hamiltonian given 
by a random Hermitian matrix. For such a system, we expect no such hierarchy, so our 
conjecture does not apply. 


4.3.4 Rindler space and the scattering bonnd 

Field theories on Rindler space are simple examples of thermal systems. In this case the 
Minkowski vacuum is the thermoheld double state. For the case of conformal held theories 


in d > 2, one can prove that the bound (25) holds with small £ for Rindler correlators 
of well separated operators. This follows from the fact that the correlators are related to 
Minkowski vacuum four point functions which can be approximated using the operator 


product expansion. For theories with gravity duals this implies the scattering bound 32 


mentioned in § [^ We discuss this point more extensively in appendix A. This appendix 
also serves as a worked out example of the considerations in this paper. 


4.3.5 Semiclassical billiards 


At hrst sight one might think that a classical system could violate the bound since classical 
Lyapunov exponents can take any value. However, restoring dimensionful factors, our 
conjecture is 


Ar < 


27ikBT 

h ’ 


(29) 


SO there is no contradiction in the strict classical limit h —?• 0. 

It is interesting to consider a semiclassical chaotic system with a small h at hnite tem¬ 
perature. For such systems, we can take £ ~ -v^^ith the large N case. The analysis 

is as before. One can also give a direct (although heuristic) argument for a bound, following 
reasoning in 24 . Consider a semiclassical chaotic system such as interacting quasiparticles 


or stadium billiards. A naive dehnition of the Lyapunov exponent is the inverse of the 
timescale Tni over which the evolution of a particle becomes nonlinear. For example, 
would be proportional to the mean free time for a system of interacting quasiparticles, or 
the time to cross the stadium for a billiards problem. To violate the bound, we would need 
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Tni ksT < h. Since ksT is the typical energy, we would need a violation of the energy-time 
uncertainty principle, indicating that the semiclassical description is invalid. 


5 Concluding remarks 


We have given a strong argument for a bound on the rate at which chaos can develop in 
general thermal quantum systems with a large number of degrees of freedom. The large 
number of degrees of freedom suppresses the initial size of the commutator causing strong 
chaos-scrambling-to develop parametrically later than dissipation. We diagnosed chaos 
using an out of time order correlator F{t) related to a commutator. Characterizing this 
growth in terms of a Lyapunov exponent, we claim that it is bounded by 


Al < 


2nkBT 
h 


(30) 


where T is the temperature of the system. 

Our direct argument for this bound relied on analyticity, as well as the physical input 
that certain time-ordered correlation functions should approximately factorize. We gave 
arguments justifying this factorization for different classes of physical systems with many 
degrees of freedom. In the general case, these arguments also relied either on large timelike 
or spacelike separation between operators. 

It is tempting to speculate that a large N system which saturates this bound will 
necessarily have an Einstein gravity dual, at least in the near horizon region. This is in 
the spirit of the speculation in 33 that a system with no light higher spin single trace 
states should have a gravity dual. 
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A Rindler space and the scattering bound 

The Rindler construction gives simple examples of thermal systems. We consider a CFT^ 
on Minkowski space and choose Rindler coordinates ds^ = —p^dt^ + dp^ + d,x]_ 2 - The 
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X x + 



Figure 2: Consider a two dimensional Minkowski space, with its right and left Rindler 
wedges. We insert two operators in the right Rindler wedge and two on the left. Then we 
act on Wl and Wr by the Rindler time translation generator, which is a boost around the 
origin. This translates upwards in time and Wl downwards in time, as shown by the 
arrows. 


Minkowski vacuum corresponds to a thermal state on Rindler space. These coordinates 
cover the right Rindler wedge. There is an identical set of coordinates which cover the 
left Rindler wedge, see £gure[^ The Minkowski vacuum can be viewed as the thermofield 
double, entangling these two systems. We can now apply our general discussion to the 
particular case of a Rindler wedge. In this context the function F(f±i/?/4) corresponds to 
an ordinary Minkowski space four point function. More precisely, imagine that we choose 
all four points inside a two dimensional subspace of the full space. Let us insert 

the four operators as shown in hgure with the points 

x]*" = ±1 , = =f1 , ^3 = , X 4 = (31) 


The cross ratios then become 


z± = 


•^ 12-^34 

1 — 

X14X32 


cosher 


(32) 


Here we have used the label t, as in the rest of this paper, to denote the flow by the Killing 
vector generating Rindler time translations. Note that this flows backwards in time on the 
left Rindler wedge, see hgure 

All four point functions of conformal primaries V, W can be computed by analytically 
continuing the hat space euclidean correlator, with suitable ie prescriptions 34,20 . The 


ie prescription that gives rise to the F correlator is the one that is natural from the point 
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of view of Minkowski space. More precisely, the correlator F{t + 1(3/4) corresponds to a 
correlator in Minkowski space with the standard time orderinj^ see hgure m 

F{tFtl3/A)=ii[W{t)Vy^W{t)Vy^] = (0|fhB(t - - ze)|0) (33) 


where we have not bothered to introduce ie’s for operators that stay spacelike separated 
as we change t. On the other hand, a correlator that naturally factorizes at large times is 
given by 

tT[VW{t)y^W{t)Vy^] = {0\WL{t + te)Wn{t - te)VnVL\0) (34) 

For t = 0 and cr 0 these correlators are equal. They are in the Euclidean OPE region 
in the VV channel (or 12 channel). We will now keep a hxed and increase t. Increasing 
t we pass through a point where two of the operators are null separated at t + a = 0. 
At this point = 1. This is a singular point for the four point function. By suitably 
smearing the operators we can remove the singularity. Notice that, for cr <C 0, the other 
cross ratio, z_, remains small throughout the discussion. Therefore, using the OPE in the 
VV channel, we can expand the correlators in a series of the form 


E 


A+SA-S^ 

Z_^ Z_ Ca,S 


(35) 


where A and S are the dimension and spin of the intermediate operators. Since is 
small, after smearing in z^, we can apply a uniform bound for this quantity when d > 2, 
since unitarity implies thatp^ A — S > This holds on the hrst sheet of the z^ plane. 
The ie prescription in (34) implies that z+ remains on the hrst sheet as we change t. But 
for (33) we circle around the branch cut at = 1, which changes the behavior when we 
return to —)■ 0. In conclusion, we hnd that by taking V and W far away in space we 
ensure that (34) factorizes as indicated in (23) for all times. Therefore the bound (25) is 
a theorem in this situation. 


The dissipation time td is just the inverse of the smallest A in (35). The manifest 
lack of recurrences here can be interpreted thermally as due to the inhnite entropy of the 
thermal system on Hd-i- As we remarked above, here the Lyapunov exponent is the same 
as the BFKL intercept Xl = j{t = 0) — 1 28,19. The high energy nature of the process 
for large t is apparent from hgure 


Recall that the ie prescription for any ordered Minkowski correlator (0|O(x„) • • • 0(a;2)0(a;i)|) is that 
we add ^ — iei with < e^+i. Note, however, that the shift in Minkowski time to —ie in the left 

wedge translates into a shift into the +ie direction in the t coordinate due to opposite flow of time there. 


^^The exceptions in two dimensions pointed out in § 
A = S there, like the stress tensor. 


4.3.3 


follow from the existence of operators with 
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We now consider large N CFTs which have an Einstein gravity dual. We can extend 
the Rindler coordinates through the bulk and we can view the resulting space as a zero 
mass hyperbolic black hole, or a two sided hyperbolic black hole. The bulk scattering that 
is dual to chaos here is just high energy gravitational scattering in vacuum AdS space. 
More precisely F is computed by folding bulk to boundary propagators against the bulk 
gravitational scattering amplitude 18,19 . When the scattering is weal^^ the propagator 
variation is a small effect and the rate of decrease of F directly diagnoses the size of the 
eikonal phase 5(s). The bound (25) shows that this phase cannot increase faster than s. 

This is an alternate derivation of the scattering bound in that helped motivate 
this work. More precisely, we get the bound |1 + i5(s)| < 1 + 0(6“^) in the upper half s 
plane, when 6{s) is small but of order one. This bound also implies the positivity of the 
Shapiro time delay. This is a nontrivial constraint for classical Gauss-Bonnet theories, it 

rules them out as classical theories 32 . The exchange of a spin J held in the Mandelstam 

.J-i 


t channel gives (5(s) 


~ s 


Then the bound (25) rules out any weakly coupled large 


radius bulk theory with a hnite number of light particles with spin greater than two. 
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